Search CORE

23 research outputs found

LIMEtree: Interactively Customisable Explanations Based on Local Surrogate Multi-output Regression Trees

Author: Flach Peter
Sokol Kacper
Publication venue
Publication date: 04/05/2020
Field of study

Systems based on artificial intelligence and machine learning models should be transparent, in the sense of being capable of explaining their decisions to gain humans' approval and trust. While there are a number of explainability techniques that can be used to this end, many of them are only capable of outputting a single one-size-fits-all explanation that simply cannot address all of the explainees' diverse needs. In this work we introduce a model-agnostic and post-hoc local explainability technique for black-box predictions called LIMEtree, which employs surrogate multi-output regression trees. We validate our algorithm on a deep neural network trained for object detection in images and compare it against Local Interpretable Model-agnostic Explanations (LIME). Our method comes with local fidelity guarantees and can produce a range of diverse explanation types, including contrastive and counterfactual explanations praised in the literature. Some of these explanations can be interactively personalised to create bespoke, meaningful and actionable insights into the model's behaviour. While other methods may give an illusion of customisability by wrapping, otherwise static, explanations in an interactive interface, our explanations are truly interactive, in the sense of allowing the user to "interrogate" a black-box model. LIMEtree can therefore produce consistent explanations on which an interactive exploratory process can be built

arXiv.org e-Print Archive

Explore Bristol Research

Towards Intelligible and Robust Surrogate Explainers:A Decision Tree Perspective

Author: Sokol Kacper
Publication venue
Publication date: 23/03/2021
Field of study

Explore Bristol Research

One Explanation Does Not Fit All The Promise of Interactive Explanations for Machine Learning Transparency

Author: Flach Peter
Sokol Kacper
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/01/2020
Field of study

The need for transparency of predictive systems based on Machine Learning algorithms arises as a consequence of their ever-increasing proliferation in the industry. Whenever black-box algorithmic predictions influence human affairs, the inner workings of these algorithms should be scrutinised and their decisions explained to the relevant stakeholders, including the system engineers, the system's operators and the individuals whose case is being decided. While a variety of interpretability and explainability methods is available, none of them is a panacea that can satisfy all diverse expectations and competing objectives that might be required by the parties involved. We address this challenge in this paper by discussing the promises of Interactive Machine Learning for improved transparency of black-box systems using the example of contrastive explanations -- a state-of-the-art approach to Interpretable Machine Learning. Specifically, we show how to personalise counterfactual explanations by interactively adjusting their conditional statements and extract additional explanations by asking follow-up "What if?" questions. Our experience in building, deploying and presenting this type of system allowed us to list desired properties as well as potential limitations, which can be used to guide the development of interactive explainers. While customising the medium of interaction, i.e., the user interface comprising of various communication channels, may give an impression of personalisation, we argue that adjusting the explanation itself and its content is more important. To this end, properties such as breadth, scope, context, purpose and target of the explanation have to be considered, in addition to explicitly informing the explainee about its limitations and caveats...Comment: Published in the Kunstliche Intelligenz journal, special issue on Challenges in Interactive Machine Learnin

arXiv.org e-Print Archive

Explore Bristol Research

LIMEtree:Interactively Customisable Explanations Based on Local Surrogate Multi-output Regression Trees

Author: Flach Peter
Sokol Kacper
Publication venue
Publication date: 04/05/2020
Field of study

Explore Bristol Research

Towards Faithful and Meaningful Interpretable Representations

Author: Flach Peter
Sokol Kacper
Publication venue
Publication date: 16/08/2020
Field of study

Interpretable representations are the backbone of many black-box explainers. They translate the low-level data representation necessary for good predictive performance into high-level human-intelligible concepts used to convey the explanation. Notably, the explanation type and its cognitive complexity are directly controlled by the interpretable representation, allowing to target a particular audience and use case. However, many explainers that rely on interpretable representations overlook their merit and fall back on default solutions, which may introduce implicit assumptions, thereby degrading the explanatory power of such techniques. To address this problem, we study properties of interpretable representations that encode presence and absence of human-comprehensible concepts. We show how they are operationalised for tabular, image and text data, discussing their strengths and weaknesses. Finally, we analyse their explanatory properties in the context of tabular data, where a linear model is used to quantify the importance of interpretable concepts

arXiv.org e-Print Archive

Explore Bristol Research

(Un)reasonable Allure of Ante-hoc Interpretability for High-stakes Domains: Transparency Is Necessary but Insufficient for Explainability

Author: Sokol Kacper
Vogt Julia E.
Publication venue
Publication date: 04/06/2023
Field of study

Ante-hoc interpretability has become the holy grail of explainable machine learning for high-stakes domains such as healthcare; however, this notion is elusive, lacks a widely-accepted definition and depends on the deployment context. It can refer to predictive models whose structure adheres to domain-specific constraints, or ones that are inherently transparent. The latter notion assumes observers who judge this quality, whereas the former presupposes them to have technical and domain expertise, in certain cases rendering such models unintelligible. Additionally, its distinction from the less desirable post-hoc explainability, which refers to methods that construct a separate explanatory model, is vague given that transparent predictors may still require (post-)processing to yield satisfactory explanatory insights. Ante-hoc interpretability is thus an overloaded concept that comprises a range of implicit properties, which we unpack in this paper to better understand what is needed for its safe deployment across high-stakes domains. To this end, we outline model- and explainer-specific desiderata that allow us to navigate its distinct realisations in view of the envisaged application and audience

arXiv.org e-Print Archive

Counterfactual explanations of machine learning predictions:Opportunities and challenges for AI safety

Author: Flach Peter
Sokol Kacper
Publication venue: CEUR Workshop Proceedings
Publication date: 27/01/2019
Field of study

Explore Bristol Research

The role of textualisation and argumentation in understanding the machine learning process

Author: Flach Peter
Sokol Kacper
Publication venue: 'International Joint Conferences on Artificial Intelligence'
Publication date: 25/08/2017
Field of study

Crossref

Explore Bristol Research

Activity recognition in multiple contexts for smart-house data

Author: Flach Peter
Sokol Kacper
Publication venue
Publication date: 22/06/2016
Field of study

Explore Bristol Research

bLIMEy:Surrogate Prediction Explanations Beyond LIME

Author: Flach Peter
Hepburn Alexander
Santos-Rodriguez Raul
Sokol Kacper
Publication venue
Publication date: 28/10/2019
Field of study

Surrogate explainers of black-box machine learning predictions are of paramount importance in the field of eXplainable Artificial Intelligence since they can be applied to any type of data (images, text and tabular), are model-agnostic and are post-hoc (i.e., can be retrofitted). The Local Interpretable Model-agnostic Explanations (LIME) algorithm is often mistakenly unified with a more general framework of surrogate explainers, which may lead to a belief that it is the solution to surrogate explainability. In this paper we empower the community to "build LIME yourself" (bLIMEy) by proposing a principled algorithmic framework for building custom local surrogate explainers of black-box model predictions, including LIME itself. To this end, we demonstrate how to decompose the surrogate explainers family into algorithmically independent and interoperable modules and discuss the influence of these component choices on the functional capabilities of the resulting explainer, using the example of LIME.Comment: 2019 Workshop on Human-Centric Machine Learning (HCML 2019); 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canad

arXiv.org e-Print Archive

Explore Bristol Research